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Abstract 



> 

. Let n, r, fc be positive integers such that 3 < k < n and 2 < r < k — 1. Let m(n, r, fc) denote the 

' maximum number of edges an r-uniform hypergraph on n vertices can have under the condition that any 

collection of i edges, span at least i vertices for all 1 < i < k. We are interested in the asymptotic nature 
of m(n, r, k) for fixed r and k as n — > oo. This problem is related to the forbidden hypergraph problem 
introduced by Brown, Erdos, and Sos and very recently discussed in the context of combinatorial batch 
' codes. In this short paper we obtain the following results. 

(i) Using a result due to Erdos we are able to show m(n,k,r) ~ o(n r ) for 7 < k, and 3 < r < 
k — l— [log k~\ . This result is best possible with respect to the upper bound on r as we subsequently 
show through explicit construction that for 6 < fc, and k — [log k~\ < r < k — 1, m(n, r, k) = Q(n r ). 
This explicit construction improves on the non-constructive general lower bound obtained by 
Brown, Erdos, and Sos for the considered parameter values. 

(ii) For 2-uniform CBCs we obtain the following results. 

(a) We provide exact value of m(n, 2, 5) for n > 5. 

(b) Using a result of Lazebnik et al. regarding maximum size of graphs with large girth, we 
improve the existing lower bound on m(n,2,k) (^(n 1 ^ 1 )) for all fc > 8 and infinitely many 
values of n. 

(c) We show m(n, 2, fc) = 0(n L 4 J ) by using a result due to Bondy and Simonovits, and also 
show m(n, 2, fc) = <d(nz ) for fc = 6, 7, 8 by using a result of Kovari, Sos, and Turan. 

1 Preliminaries 

Hypergraphs and Turan numbers 

We briefly make some definitions, and set up our notation. 
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A hypergraph F is a tuple F := (V, J-), where V is a set of vertices and J 7 is a family of subsets of 
V. Sets of T are called edges of the hypergraph and cardinality of T is called size of the hypergraph. A 
hypergraph T is called r-uniform if each of its edges has cardinality r. For a vertex x € V, we will denote 
by deg(x), the number of edges in T containing x. Further, by Kn we will denote the complete r-uniform 
hypergraph on n vertices, and by K^ r '(l, . . . , I) we will denote the complete r-uniform r-partite hypergraph 
with I vertices in each part. We denote by K{s,t) the complete bipartite graph with partition sets of size 
s and t respectively and by Q, a cycle of length i. 

Let T~L be a family of r-uniform hypergraphs. By the Turdn number of the family % denoted by ex(n, T~L), 
we mean the maximum size of an r-uniform hypergraph on n vertices that does not contain a copy of any 
of the hypergraphs of H as a subgraph. 

Combinatorial Batch Codes and an extremal problem 

The notion of Batch Codes was introduced in [15] as an abstraction of a load balancing problem in a 
distributed database setup. Loosely speaking, an (m, NJs, n, t j3-batch code models the problem of storing 
m data items into n servers in such a way that any k Hpf the n data items may be retrieved by reading 
at most t items from each server and the overall storage to be limited to N. Combinatorial Batch 
Codes (CBCs), also introduced in [15], and subsequently studied in [T7], [5], [TJEIE], PQ, is a subclass 
which models the scenario when each of the N stored data items is a copy of each of the m input data 
items, i. e., the m input data items are replicated among n servers. This restriction makes the problem a 
purely combinatorial one which can be studied in the setting of a hypergraph. Without providing further 
details (cf. [13 [T]) we state the following theorem of [17] which characterizes CBCs in the setting of a 
hypergraph. 

Theorem 1.1 ([H]). A hypergraph (V,J-") represents an(m,N,k,n)-CBCif\V\ = n, [J 7 ! = m, X^FejH-^l = 
./V and every collection of i edges of T contains at least i vertices for 1 < i < k. 

Henceforth, we will refer to the hypergraph representing a CBC as a CBC with corresponding param- 
eters. Naturally a CBC will be termed r-uniform if the corresponding hypergraph is r-uniform. 

The problem that we will address in this article is that of maximizing the number of data items (m) of 
a uniform CBC for given values of the number of servers (n) and retrievability parameter (k). Equivalently, 
and more formally we have the following extremal problem. 

Let n, r, k be positive integers such that 3 < k < n and 2 < r < k — 1. Determine m(n, r, k), the 
maximum number of edges a r-uniform hypergraph on n vertices can have under the condition 
that any collection of i edges, 1 < i < k, span at least i vertices. 

This is a forbidden hypergraph problem, where we have the following family of forbidden hypergraphs. 

rl r (k) = {H : H is an r-uniform hypergraph with i edges and < i vertices for 1 < i < k} (1) 

*In batch code literature, number of data items is denoted by n and number of servers is denoted by m. In this article, we 
deviate from this and reverse the roles of m and n to make it consistent with the common notations used in the hypergraph 
setting 

^In this paper, we will refer to this parameter as the retrievability parameter. 

*In this article, we will exclusively consider the case of t = 1 as that seems to capture the essence of the problem and this 
is the case for most of the work done in this area. So, henceforth, we will not mention this parameter in any expression with 
the understanding that t — 1 case is considered. 



2 



We are interested in the asymptotic nature of m(n, r, k) = ex(n, H r (k)) for fixed r and k as n — > oo. 
This type of extremal problem was introduced by Brown, Erdos, and Sos in [4] , where the authors considered 
as forbidden family the following family of hypergraphs. 

W r (p, q) = {H : H is an r-uniform hypergraph with p vertices and q edges } (2) 

They showed, through non-constructive arguments, the following lower bound. 

ex(n,H r {p,q)) = n(n^) (3) 

We make the following observations to simplify the problem that we are considering in this article, i. e., 
finding the asymptotic nature of m(n, r, k) for fixed r and k as n — > oo. 

Remark 1.1.1. 1. The parameter r considered in this article is a constant independent of n, and since 
we are interested in the asymptotic nature of m(n,r,k), we shall assume that all the hypergraphs 
studied in this article do not have multiple copies of any r-set, i. e., the considered hypergraphs are 
all simple. Indeed, observing that for the forbidden hypergraphs considered above, any edge can have 
at most r copies, we are guaranteed to have at least a fraction of £ of the maximal possible size of 
an extremal hypergraph with repeated edges, and hence the asymptotic estimate for m(n, r, k) does 
not change. 

2. If a collection of J~i of i distinct edges span < i vertices, then there is a subcollection J-j C T{ of j 
distinct edges that span exactly j — 1 vertices. 

3. Though it does not find any significant role in the sequel, we mention here for the sake of preciseness 
that in a simple hypergraph, a collection of 2 distinct r-edges span at least r + 1 vertices and a 
collection of r + 2 distinct r-edges span at least r + 2 vertices. 

From the above observations, the family of forbidden hypergraphs of (pQ) can be equivalently defined as 
below. 

T-L r {k) = {H : H is a simple r-uniform hypergraph with i edges and i—1 vertices for r + 3 < i < k} (4) 

A lower bound for m(n, r, k) was given in [15], where the authors obtained the following result using a 
simple probabilistic argument: 

m(n,r,k) = ^(n^ 1 ). (5) 

In [17] . the authors using a method (method of alteration, same as in the proof of ([3])) of [4], improved 
this lower bound: 

kr 

m(n, r, k) = Q,(n k - 1 ). (6) 
On the other hand, in [17] . the authors obtained the following upper bound: 

m (v,*)<|^("). (7) 

Now, it is trivial to see that this bound is met exactly for r = 1. In [17] . it was shown by explicit 
construction, that this bound is tight (in exact terms) for the cases r = k — 1 and r = k — 2. Indeed, k — 1 
copies of Kn ^ and k — 2 copies of K^t 2 \ respectively do the job. 
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So, ([7|) essentially shows an upper bound 0{n T ) for m(n, k,r). We are interested in the values of r in 
the range 2 < r < k — 3, for k > 5. 

In this article we obtain the following results. 

(i) In Section [2j we improve the upper bound ([7]) in an asymptotic sense. In particular, using a result 
due to Erdds (|12j) we are able to show m(n,k,r) = o{rf) for 7 < k, and 3 < r < k — 1 — [log A;]. 
This result is best possible with respect to the upper bound on r as we subsequently show through 
explicit construction that for 6 < k, and k — [log k~\ < r < k — 1, m(n, r, k) = B(n r ). 

This explicit construction improves on the general lower bound ([3]) obtained by Brown, Erdos, and 
Sos for the parameters p = k — l,q = k,k — [log k~\ < r < k — 1, where k > 6. In this case their lower 

fcr 

bound is the same as ([6]), i. e., ^(n*- 1 ). 

(ii) In Section 5, we deal with the graph case, i. e., 2-uniform CBCs and obtain the following results. 

(a) We provide exact value of m(n, 2, 5) for n > 5. 

(b) Using a result of Lazebnik et al. [2] regarding maximum size of graphs with large girth we 

fc+i 

improve the existing lower bound (Vl{n k - 1 )) of [17] for all k > 8 and infinitely many values 
of n. 

(c) We show m(n, 2, fe) = 0(n ^ J ) by using a result due to Bondy and Simonovits, and also show 
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m(n, 2, k) = 0(n2 ) for A; = 6, 7, 8 by using a result of Kovari, Sos, and Turan. 

2 r-UNIFORM CASE FOR V > 3 

We begin this section by stating the following result due to Erdos ( which is a generalization of a result of 
Kovari et al. [16J) that will be crucial in our proof of Theorem 12.21 

Theorem 2.1 ([E]). Let n > no(r,l),l > 1. Then for sufficiently large C (C is independent ofn,r,l), 

n r ~T^ < ex(n,K (r) (l,...,l)) < n r ~W^ . 



We first show that m(n,k,r) = o(n r ) for 7 < k, and 3 < r < k — 1— [log A;]. All the logarithms 
mentioned in this paper are to the base 2. 

Theorem 2.2. Let k > 7, and 3 < r < k — 1 — [log A;]. Then for sufficiently large n (n > no(r)), 
i 

m(n,r,k)<n . 

Proof. Let 7 < u < k, 1 < v < u — 2[logn] be such that r = u — v — [log it]. It is clearly possible to 
find such u, v for the range of values of r as in the hypothesis. Consider the r-uniform, complete r-partite 
hypergraph H = (V, J 7 ), where 

V := {xi, . . . , X u _ v _ 2 [log n] ? ■ ■ ■ ) x ri 2/1) ■ ■ ■ i Uu— v— 2 [log u] ' ■ ■ ■ ' 2/r} 

and 

T := \ {zi,. . . ,z r } : Zi e {xi,yi}, 1 <i <r\. 
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Next, consider the r-uniform sub-hypergraph H' = (V', J 7 ') of H, where 



V 5 V : — {x\, . . . , X u -v-2\\ogu\ j • • • ) x r, Uu-v-2\logu} + li • • • j 

and 

J 7 ' := . ■ ,x u _ v _ 2 \iogu], z u -v-2\io g u]+i, ■ ■ -,z r } ■ Zj G {xj,yj},u- v - 2 flog n] + 1 < j < r\. 

Since v > 1, 

|V'| = u-v<tt-l, (8) 



and 

\T'\=^ lo ^>u. (9) 

So, % is a forbidden hypergraph required in Theorem 12.11 containing a forbidden hypergraph %' of the 
collection T-L r (k) of (P). Hence, it follows from Theorem 12.11 that for sufficiently large n, i. e., for n > no(r), 
m(n,r,k) < ex(n,T-L') < ex(n,7i) < n r ~v F=J . □ 

A few remarks regarding the theorem are in order. 



Remark 2.2.1. 1. Each edge of J- 1 has the fixed set of vertices {x%, . . . , x u _ v _2\\ g U -\ }• This choice is 
arbitrary and any fixed set of u — v — 2 [log u] vertices {z\ , ■ ■ ■ , z u „ v _2\iog u] } could have been chosen 
maintaining the condition Zj G {xj, yj}, 1 < j < u — v — 2[logn]. 

2. One can also see that the same construction with partite sets of size /, in conjunction with Theorem 
12.11 gives us a corresponding result for r < k — 1 — (I — 1) [log; k~\ . 

3. Inequalities ([8]) and (|9]) are tight when u is a power of 2 and v = 1. In particular, when /c is a power 
of 2 and r = k — 1 — log k, we have | V'| = k — 1 and \ J-'\ = k. So, k edges of %' span exactly k — 1 
vertices. 

Theorem 2.3. m(n, r, k) = Q(n r ) for 6 < k, k — [log k~\ < r < k — 1. 

Proof. Here we show m(n,r,k) = Q(n r ) for the stated ranges of values of r and k. This, together 
with m(n,r,k) = 0(n r ) from ([7]), would imply m(n,r,k) = @(n r ). Also, first we prove the above for 
r = k — [log k~\ , as this turns out to be the tight case and the same argument can easily be applied for the 
rest of the range of values of r. Note that the cases r = k — 1 and r = k — 2 have already been settled in 
[I7j. 

Construction: Consider the complete r-uniform, r-partite hypergraph T~L = (V,J~), where V = Vi U V2 U 
. . . U V r , such that Vi n Vj = for i / j, and |V«| = L 1 ^ 1 ], for 1 < i < r. Clearly \T\ = fl{n r ). 

Claim. % does not contain a sub-hypergraph H' = (V',J- r ) such that \V'\ = i — 1 and \J-'\ > i for 
r + 3<i<k. 

Proof. First, we observe that any sub-hypergraph T-C = (V, J-') such that |V'| = i — 1 and [J 7 '! > i for 
r + 3 < i < k, there is another sub-hypergraph H" = (V", J 7 "), such that C H" C "H, |V"| = - 1, 
and jJ 7 "! > A;. To get from W we simply add k — i edges to J 7 ', where each edge contains exactly one 
unique vertex not contained in V', i. e., these k — i unique vertices are distinct for these distinct k — i 
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added edges. This is always possible due to the structure of H provided there are k — i distinct vertices 
in V \ V. But this can be safely assumed because n is large enough (due to the asymptotic nature of the 
problem), in fact, n > 2k would suffice. Due to this, it is sufficient to establish that H does not contain a 
sub-hypergraph %' = (V", F') such that \V'\ = k — 1 and > k. 

Next, let us consider any V C V' with \V\ = k - 1. Then V = V\ U V' 2 U . . . U V' r , where V; C V i; for 
1 < % < r. Also \V'i\ > 1 (for otherwise T' would be empty, trivially prooving the claim), S[ =1 |V"i| = |V'| = 
k- 1, and \T'\ = n[ =1 |V'i|. Here, it is important to note that \{V'i : |V';| = 1, 1 < i < r}\ > 2r - k + 1 > 1 
for 6 < k, and r = k — [log fc] . 

Now, observe that maximum of \J-'\ is attained when there are exactly (k — r — 1) V'jS, 1 < i < 
with |V'i| = 2, and for the remaining 2r — k + 1 V'^s, |V'i| = 1. This can be seen by the following shifting 
argument. 

Without loss of generality let us assume that |V'i| = 1 and | V'2 1 = I, where I > 3. Now, it is easy to 
see that by shifting a vertex from V'2 to V'i, | V'i 1 1 V2 1 becomes 2(1 — 1), which is > I for / > 3. 

So, following the above argument, maximum of is attained when \V'i\ are as equal as possible 
for 1 < i < r, and since as observed earlier we always have |{|V'j| = l,l<i<r}| > 1, it follows that 
\V'i\ G {1,2} for 1 < i < r. This implies \T'\ < 2 k - r ~ 1 = 2 n°e ^1 - 1 < k - 1. This proves the claim. 

Now, applying the same argument for the cases k — [log /c] < r < k — 1, one can easily see that 
\P\ < 2 fe -r-i < 2n°g fc l- 1 < k - 1. Hence, the theorem. □ 

3 2-UNIFORM CASE 
For 2— uniform (graph) CBCs, we provide the following improvements over existing results. 

4 An Exact Result 

2 

Theorem 4.1. m(n,2,5) = [^J for n > 5. 

Proof. We first remark that in [T7] , Paterson et al. observed that the complete bipartite graph on n vertices 

2 

satisfies the batch condition for k = 5, and hence concluded that m(n, 2, 5) > [^J . Hence it suffices to 

2 

prove that m(n, 2,5) < L^J. In particular we only need to show that any graph with n vertices and at 

2 

least L^J + 1 edges contains a subgraph with 4 vertices and 5 edges. 

We prove this by induction on n. This is clearly true for n = 4. Now suppose we have a graph with 

2 

n vertices satisfying the given condition. We may assume that the graph has exactly [^-J + 1 edges by 
throwing any extra edges away. Observe that the graph contains a vertex of degree at most [^J . Removing 

this vertex along with all its incident edges leaves a graph with n — 1 vertices and at least [— + 1 
edges, which by the induction hypothesis contains a subgraph with 4 vertices and 5 edges. □ 

5 Improvement of the lower bound 

For improvement of the lower bound we need the following lemma whose proof, although not difficult, is 
provided here for the sake of completeness. 

Lemma 5.1. Let k > 6 be a positive integer. If a graph has k edges and at most k — 1 vertices then it has 
girth at most [^J . This bound is tight. 
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Proof. For k = 6 the statement is clear. Suppose the statement does not hold for some k > 6. Pick k 
minimum so that the statement does not hold, i.e., suppose that for some k we have a graph G with k 
edges, and at most k — 1 vertices, and the firth of G is at least > [^yj + 1- We may assume that the graph 
is connected. Since G has k edges and at most k — 1 vertices, it contains at least 2 distinct cycles, C\, C2, 
say. Now, if C\ and C2 are edge disjoint then one of them will have length at most |_§J < |_irJ which 
contradicts the assumption regarding k. 

If C\ and C2 are not edge disjoint let Iq = \E(Ci)nE(C2) \ be the number of common edges and h,fa be 
the number of edges that exclusively belonging to the Ci, and C2 respectively. Clearly, Iq + h = l-E'(Ci) | , 
and lo + 1% = \E(C2)\- Now, consider the subgraph of G consisting of the edges of C1AC2; every vertex in 
this subgraph has even degree, so in particular it contains a cycle. By the assumption on the girth of G, 
we have 

U + Ij > LyJ + 1, 

for < i < j < 2, which gives 

^0 + h + h > k, 

a contradiction. 

This bound is best possible since we may consider two vertices joined by three pairwise vertex-disjoint 
paths of length | (for k a multiple of 3) to give a tight example. □ 

Next we mention the following result of Lazebnik et al. and apply it to improve the lower bound [6] on 
m(n, 2, k) of [IT], which in this case is ^(n k ~ 1 ), for all k and infinitely many values of n. 

Theorem 5.2 (|14J). For s > 2, ex(n, {C 3 , C 4 , . . . , C 2s +i}) = n(n^^+^) for infinitely many values of 
n, where e = if s is odd, and = 1 if s is even. 

Corollary 5.3. Let k > 8 then 



m(n, 2, k) = < 



k — S 


ifk 


= 5 


mod 6 








fc — 2 


ifk 


= 2 


mod 6 


or k 


= 4 


mod 6 


fc — 1 


ifk 


= 1 


mod 6 


or k 


= 3 


mod 6 




ifk 


= 


mod 6 









for infinitely many values of n. 

Proof. The proof follows directly from Lemma 15.11 and Theorem 15.21 We remark here that for the cases 
where k = mod 3 or k = 1 mod 3 the bounds of the corollary may be improved as Lemma 15 . 1 1 requires 
the girth to be [^J + 1, which is odd in these cases. Whereas the bounds obtained were by applying 
Theorem 15.21 for graphs of girth [^J + 2. 

6 Improvement of the upper bound 

Next, we show improve the upper bound on m(n, 2, k). We begin with the following theorem due to Bondy 
and Simonovits. 

Theorem 6.1. flSj) If in a graph of order n, the number of edges > 100fcn 1+ fc , then the graph contains a 
C21 for every I £ [k, kn^]. 

Theorem 6.2. For k > 4,m(n, 2, k) = 0{n 1+ P), where (3 = -A-r. 
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Proof. Here we show that m(n, 2, k) < 100A;n 1+ ^. Consider a graph G with 100fen 1+ ^ edges. Now, it is 
well known that any graph has a subgraph whose minimum degree is at least half of the average degree of 
the original graph. Hence, G has a subgraph H with minimum vertex degree at least lOO/crcA 

Now, Theorem 16.11 implies that H has a cycle C of length at most 2[j\. Let v G C be an arbitrary 
vertex, and consider all the walks of length |_fj in H starting at v in which no edge repeats consecutively 
in the walk. It is easy to see that the number of such walks is 

100fen /3 (100fcn /3 - l)^ -1 > n. 

Consequently, there is a vertex v' , such that at least two distinct walks of length |4j starting at v, terminate 
at v' . These two walks along with C give rise to a forbidden configuration consisting of at most k edges 
spanning at most k — 1 vertices. □ 
Theorem 16.21 leads to trivial upper bound 0(n 2 ) on m(n, 2, k) for k = 6,7. However, it turns out that 
we can improve this trivial upper bound on m{n, 2, k)ior k = 6, 7 by the following well-known theorem due 
to Kovari et al. whose generalization is Theorem 12.11 

Theorem 6.3 (PS], see also [2]). Suppose 2 < s, 2 <t. Then 

ex(n,K(s,t)) < -(s — l)?(n — t + l)n x ~£ -f _(£ - l) n . 

3 

Corollary 6.4. m(n, 2, k) = 0(na ) for k = 6,7, 8. 
Proof. 

3 

(a) Theorem 16.41 clearly implies an upper bound of 0{n^) for ex{n,K{s,2)). More precisely, it implies 

3 

there is a constant c s> 2 such that for all n > uq, any graph of order n with more than Cg^n? edges will 
contain a K{s, 2). Now, s = [§"],(& > 6) serves our purpose. Because if([|],2) has > k edges and 
< k — 1 vertices for k > 6, and hence contains the forbidden structure of k edges spanning < k — 1 
vertices as subgraph. This implies m(n, 2, k) = 0{n^) for k > 6. 

To prove the asymptotic tightness of the upper bound for the cases k = 6,7, 8, we note that Lemma [5. II 
implies a graph which is {C3, C4, Cs}-free, gives rise to 2-uniform CBC with retrievability parameter 
k for k < 8. Now, it is well-known that for q a prime power, the incidence graph of PG(2, q) is a 
q + 1-regular bipartite graph with 2{q 2 + q + 1) vertices and girth at least 6. In fact, and it was shown 

3 

in [13], for sufficiently large n this construction leads to a graph on n vertices having f2(n2) whose 
girth is at least 6. Incidentally, this construction also improves on the non-constructive lower bound 
of © for m(n, 2, 7) and m(n, 2, 8). So, finally we have m(n, 2, k) = £l(n,2 ) for k = 6,7, 8. 

□ 

7 Concluding Remarks 

While the draft was under preparation, the authors were informed by C. Bujtas and Z. Tuza |llj about 
their simultaneous and independent work in similar direction. The family of forbidden configurations they 
consider is the following. 

H r (k,q) = {H : H is r-uniform A \E{H) \ - \V{H)\ =q+\M< \E(H)\ < k}, 

where r > 2,q > —r + l,k > q + r + 1 are fixed integers, and V(H) and E(H) denote respectively the 
number of vertices and edges of the hypergraph H. 
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Their results improve on the upper bound on m(n, r, k) given in Theorem 12. 21 for some ranges of values 

!+-r- 

of r. Further, their results show that m(n, 2, k) = 0(n ) which clearly improves on our upper bound 
of 0(n W ), and also imply, together with Corollary 15.21 the following asymptotically exact results: 

m(n, 2, k) = 0(nf ), for k = 9, 10, 11. 
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